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Specification 
Title of the Invention 
internet Home Page Data Acquisition Method 

Pi-Knr-'i"^ nf thp invention 

The present invention relates to an Internet 
home page data acquisition method of automatically 
acquiring browsed Internet home page data, i . e . , the 
hypertext data of WWW (World Wide Web) home pages and, 
more particularly, to an Internet home page acquisition 
method which can actually shorten the line connection 
time in a communication path that occupies a telephone 

or portable phone line. 

Conventionally, acquisition of all Internet 
15 home page data has been managed on the client side. 

cache management has been performed on the server side 
as well as on the client side. More specifically, home 
page data accessed in the past are stored in a cache on 
the server side to reduce the actual amount of access to 

20 the WWW server. 

As an example of this technique, a method is 
disclosed in "Data Transfer System" in Japanese Patent 
Laid-open No. 10-21174 (reference 1), which is not 
limited to a WWW server and designed to suppress an 
increase in communication time when a client on a LAN 
(Local Area Network) requests data stored in a data 
server connected to a WAN (Wide Area Network) and the 
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requested data is transferred. According to reference 1, 
in response to a data request from the client to the 
data server, the cache server transfers the requested 
data to the client if the data is stored in the cache 
server. If the data is not stored in the cache server, 
a data request is sent to the data server, and the data 
is transferred to the client through the cache server. 

In an environment in which a user browses data 
in a WWW server by tracing URLs (Uniform Resource 
Locators) one by one as in Internet surfing, the method 
disclosed in "Internet Home Page Management System", 
Japanese Patent Laid-Open No. 10-240604 (reference 2) 
has been proposed. According to reference 2, in order 
to reduce the amount of access to the network, the WWW 
15 server transmits only a changed portion of home page 
data in the WWW server. 

As disclosed in "Automatic Hypertext 
Acquisition Apparatus", Japanese Patent Laid-Open 
No. 10-207759 (reference 3), a method has been proposed, 
by which a user can efficiently and automatically 
acquire effective pages in a small cache area in a 
portable terminal or the like. In the method disclosed 
in reference 3, hypertext data are analyzed to trace 
links to the original hypertext data in accordance with 
the link tags of the hypertext data so as to store, in a 
cache, files that are likely to be downloaded, before 
the user traces the links, thereby shortening the time 
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required for connection 



to the network. 



The following 



problem is, however, posed in 



the conventional method 



of managing acquisition of all 



Internet home page data 



on the client side. 



5 



A client uses 



a method of connecting to URL 



addresses one by one and 



accepting data. For this 



reason, if a WWW server in which a target URL exists is 
congested, the client has difficulty in accessing the 
WWW server. In this case, the client keeps waiting for 

10 acceptance of home page data until a timeout occurs. If, 
therefore, the number of congested WWW servers increases, 
much time is required to sequentially download the home 
page data of all necessary URL addresses. As a 
consequence, if connection is made through a telephone 

15 line or the like, the occupancy of the line undesirably 
increases . 



hypertext acquisition apparatus disclosed in reference 3. 
According to the method disclosed in reference 3, the 

20 occupancy of a line can be decreased as compared with 

the method of making a user obtain target information by 
tracing hypertext links. If, however, files are 
downloaded by the method in reference 3, even files that 
are not necessary for the user are automatically 

25 downloaded into the client, resulting in an increase in 
traffic on the network. For this reason, the load on 
the server increases because of the processing performed 



A 



similar problem arises 



in the automatic 
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to shorten the line connection time. This delays a 
response from the server. As a consequence, the time 
taken for connection to the network is prolonged. 

Furthermore, since the function of 
5 transmitting only updated portion of home page data from 
the WWW server side is lost, the connection time cannot 
be shortened. 
Summary of the Invention 

It is an object of the present invention to 
10 provided an Internet home page data acquisition method 
which can shorten the unnecessary wait time in 
connecting to a WWW server. 

In order to achieve the above object, 
according to the present invention, there is provided an 
15 Internet home page data acquisition method comprising 
the steps of transferring an access list held by a 
client to a cache URL address storage server, acquiring 
home page data by a cache server on the basis of the 
transferred access list, and transferring the acquired 
20 home page data from the cache server to the client upon 
completion of acquisition of the home page data. 
Rr-if^^f Dg^scriotion of the Drawings 

Fig. 1 is a view showing the arrangement of a 
network system to which an Internet home page data 
25 acquisition method according to the first embodiment of 
the present invention is applied; 

Fig. 2 is a view showing the arrangement of a 
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network system to which an Internet home page data 
acquisition method according to the second embodiment of 
the present invention is applied; 

Fig. 3 is a block diagram showing the internal 
5 arrangement of a cache server in Fig. 2; 

Fig. 4 is a block diagram showing the internal 
arrangement of a cache URL address storage server in 
Fig. 2; 

Fig. 5 is a view showing a folder structure 
10 for managing an access list on the client side; 

Fig. 6 is a view showing management 
information on the cache URL address storage server 
side; 

Fig. 7 is a flow chart for explaining the 
15 operation of the network system in Fig. 2; 

Fig. 8 is a view showing an Internet window to 
explain how URL addresses are set from a WWW home page 
according to the third embodiment of the present 
invention; and 

20 Fig. 9 is a view showing the arrangement of a 

network system to which an Internet home page 
acquisition method according to the fourth embodiment of 
the present invention is applied. 
Description of the Preferred Embodiments 

25 The present invention will be described in 

detail below with reference to the accompanying drawings. 
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[First Embodiment] 

Fig. 1 shows the arrangement of a network 
3,ste. to which an Internet home page data acguisition 
method according to the present invention is applred. 
and more specifically, an example of a cyclic data 

acquisition system. 

The network system of this embodiment includes 

a client 101 as a wire client and a client 102 as a 
„ireless client, which browse home page data, a publrc 
network 104 to which the client 101 is connected, a base 
station 103 for performing radio co,v™unication with the 
olient 102, a wireless network 105 to which the base 
station 103 is connected, an access server 107 for 
connecting only registered users to an Internet network 
111 through the public network 104, and an access server 
106 for connecting only registered users to the Internet 
network 111 through the wireless network 105. 

This network system also includes a cache URL 
address storage server 109 for storing and managing the 
URL addresses (cache URL addresses, of home pages for 
which cache requests are received from the olient 102, a 
■ cache server 110 which is connected to the Internet 
network 111 to retain retrieved home pages, a local 
network 108 for networking the access servers 107 and 
106, cache server 110, and cache URL address storage 
server 109, and WWW servers 112, 113, and 114 for 
storing home page data and distributing the hypertext 
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contents of the home pages worldwide through the 
Internet network 111. 

Fig. 1 shows only the three WWW servers 112, 
113, and 114 for the sake of descriptive convenience, 
5 even through countless WWW serves are connected to the 
Internet network 111. 

In this embodiment, to retain desired home 
page data in the cache server 110, first of all, the 
users of the clients 101 and 102 create access lists 
10 (cyclic lists) of the URL addresses of home pages which 
the users want to browse. 

These access lists are respectively retained 
in URL list retention folders in the clients 101 and 102. 
The respective URL list retention folders have their own 
15 attributes. Of these folders, one folder in the client 

101 and one folder in the client 102 are defined as 
server storage cyclic folders 101c and 102c, 
respectively. These server storage cyclic folders 101c 
and 102c are used to retain access lists (cyclic lists) 

20 of URL addresses from which the users want to 
collectively cache data. 

Browsers 101a and 102a in the clients 101 and 

102 create list data conforming to the format of the 
cache URL address storage server 10 9 on __the basis of the 

25 access lists retained in the server storage cyclic 

folders 101c and 102c. The browsers 101a and 102a of 
the clients 101 and 102 transfer the created access 
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lists of cache URL addresses to the cache URL address 
storage server 109 at the time of line connection, and 
then disconnect the lines. 

The cache server 110 searches for home pages 
5 on the basis of the access lists transferred to the 

cache URL address storage server 109, and sequentially 
retains home page data upon finding the target URL 
addresses. When all the home page data are retained, 
the cache server 110 connects lines to the clients 101 
10 and 102 and issues data acceptance requests to the 
clients 101 and 102. Upon reception of the data 
acceptance requests, the clients 101 and 102 accept the 
home page data from the cache server 110 and retain them 
in caches 101b and 102b. 
3_5 According to another method, the clients 101 

and 102 periodically check the retained state of cache 
data in the cache server 110, and accept only cached 
data. In this case, on the basis of periods of time 
required for the acquisition of home page data and the 
20 like, the cache server 110 notifies the clients 101 and 
102 of the times at which data should be accepted. This 
makes it possible to avoid an unnecessary increase in 
traffic on the network. 

The cache server 110 also caches home page 
25 data in accordance with requests from the clients 101 
and 102. If, therefore, the cache server 110 retains 
all the home page data requested by the clients 101 and 
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102 for the same period of time, the memory of the cache 
server 110 will be wasted. 

In order to solve this problem, different data 
retention times are set in accordance with the numbers 
5 of accesses to the respective home pages. In this case, 
the data of home pages to which larger numbers of 
accesses are made are retained for longer times. With 
regard to home pages which are accessed little, the data 
are discarded when the clients 101 and 102 accept them. 
10 This makes it possible to further reduce the traffic on 
the network. 

According to this embodiment, the clients 101 
and 102 send access lists of URL addresses of home pages, 
from which the clients want to accept data, to the cache 
15 URL address storage server 109, and the home page data 
are acquired by the cache server 110,- which is always 
connected to the network, and transferred to the clients 
101 and 102. With this operation, the home page data 
can be acquired in short connection times. 
20 In addition, it suffices if access lists 

(cyclic lists) of URL addresses to be circulated are 
stored in folders, in the clients 101 and 102, to which 
a special meaning like "dedicated to circulation" is 
assigned. The access lists in the folders are 
25 transferred to the cache URL address storage server 109 
at the time of line connection. The cache server 110 
then acquires home page data on the basis of the 
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transferred access lists. This .a.es it possible to 

Shorten the v,ait ti.e in ho.e page address search 

-.v,^ line connection time, 

operation and the J-ine 

[Second Embodiment] 

■h^ Fia 2, reference numeral 201 
Referring to Fig- ^. 

,,„„tes a Client including a .ro«ser .Ola .or .rcwsin. 

omh- 202, a local network to 
Hai-a and a cache 201D, ^u^, 
home page data ana d 

„hich the Client 201 is connected,- 203, a cache 
.aaress storage server connected to the local networ. 
.02. 204, a cache server connected to the local networ. 
202- 205, an Internet net»or. to which the cache server 
.O,' 13 connected; and 206, a server connected to the 

internet network 205. 

The cache URL address storage server 203 
causes the cache server 204 to acquire ho.e page data 

.cyclic list, o£ cache OKL addresses transferred from 
the client 201- 

• ^\r. ^ the cache server 204 
As shown in Fig. i> i^"^ 

includes a target address access rate detecting section 
.04a for Checking the access state of UK. — 
he circulated, an elapsed time measuring section 204b 
.or measuring the retention time of cache data, a home 
page data cache section 204c for storing acguired home 
page data, and a cache comparing section 204d for makrng 
a comparison to check whether the cached data is the 
latest data. 



15 

from 
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As shown in Fig. 4, the cache URL address 
storage server 203 includes a cyclic URL address 
accepting unit 203a for accepting cache URL addresses 
(cyclic URL addresses) requested by a client, a cyclic 
URL address list retaining section 203b for retaining an 
accepted access list (cyclic URL address list) of cache 
URL addresses for each client, and a client ID managing 
section 203c for checking the ID of each client, 
managing information for ensuring an area for caching 
data for each client, and managing a data acceptance 
requesting method for each client. 

An access list of cache URL addresses 
transferred from the client 201 to the cache URL address 
storage server 203 is managed in units of folders. 
Fig. 5 shows the folder structure of an access list. 

As shown in Fig. 5, the access list of URL 
addresses is managed by dividing a folder into a server 
storage cyclic folder HI and client storage cyclic 
folder H2. The access list stored in the server storage 
cyclic folder HI is transferred as an access list 
(cyclic list) of cache URL addresses to the cache URL 
address storage server 203. For the sake of descriptive 
convenience, two folders are used in this case, although 
the number of folders is not limited. 

When the cache server 204 is to be caused to 
actually perform circulation, the address information of 
the corresponding URL addresses is copied to the server 
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storage cyclic folder HI. When the client 201 is to be 
caused to actually perform circulation, the address 
information of the corresponding URL addresses is copied 
to the client storage cyclic folder H2. 

At the time of line connection, the client 201 
transfers the address information of "URLll", "URL12", 
and "URL13" stored in the server storage cyclic folder 
HI to the cache URL address storage server 203. Upon 

reception of this information, the cache URL address 
storage server 203 prepares for the acquisition of 

information. 

Fig. 6 shows management information (access 

list) in the cache URL address storage server 203. The 
cache URL address storage server 203 manages a cache 

state for each client. 

Referring to Fig. 2, the Internet network 205 
and WWW server 206 are known well by the person skilled 
in the art, and hence a description of the detailed 
arrangements will be omitted. 

The operation of the above network system will 
be described next with reference to the flow chart of 
Fig. 7. 

At the time of line connection (step S701) , 
the client 201 connects to the cache URL address storage 
server 203 (step S702) . After connection to the cache 
URL address storage server 203, the client 201 extracts 
cache URL addresses to be circulated from the server 
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storage cyclic folder HI, and transfers them to the 
cache URL address storage server 203 (step S703) . 

Upon completion of the transfer of the URL 
addresses, the client 201 temporarily disconnects from 
5 the cache URL address storage server 203 (step S704), 

and disconnects the line (step S705) . The cache server 

204 acquires and caches home page data from home pages 
at the target URL addresses through the Internet network 

205 on the basis of the access list of cache URL 

10 addresses stored in the cache URL address storage server 
203. 

Obviously, data having the same date and size 
as those of data existing in the cache server 204 are 
regarded as identical data and hence are not acquired 
15 from the WWW server 206, and only new data are stored. 

If, however, all the data of accessed home 
pages are stored, a shortage of memory capacity occurs. 
For this reason, the retention time of cache data is 
changed in accordance with the access rate of each home 
20 page. 

The retention period of the data of a home 
page with a high access rate is set be longer than that 
of the data of a home page with a low access rate. This 
makes it possible to efficiently cache data. Note that 
25 the access rate of each home page may be calculated on 
the basis of the frequency of occurrence of URLs with 
which the client 201 requests the acquisition of data. 
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According to this embodiment, the cache server 
204 that is always connected to the Internet has the 
data acquisition function, and the client 201 requests 
the cache server 204 through the cache URL address 
5 storage server 203 to acquire home page data. When all 
home page data are acquired, the client 201 accepts the 
data from the cache server 204. With this operation, 
the wait time in accessing the WWW server can be 
shortened, thus shortening the connection time of a 
10 telephone line or the like. 

In addition, a folder in the client 201 is 
assigned a special meaning like "dedicated to 
circulation", and operation is performed according to 
the meaning assigned to the folder at the time of line 
15 connection, thereby easily setting URL addresses to be 
circulated. 
[Third Embodiment] 

In the third embodiment, a further improvement 
is made on the method of transferring URL addresses to a 
20 cache URL address storage server 203. More specifically, 
as shown in Fig. 8, the message "Please input the 
addresses of home pages that you want to circulate" is 
displayed on an Internet window on a client 201 to allow 
the user to set URL addresses on a WWW home page. Note 
25 that the basic arrangement of the third embodiment is 
the same as that of the second embodiment, and hence a 
description thereof will be omitted. 
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According to this embodiment, by setting URLs 



on the home page, the user can set URL addresses by 
using a general WWW browser without requiring dedicated 
software for sending cache URL addresses to the cache 
5 URL address storage server 203. Note that the method of 
accepting stored URL data is the same as that used in 
the second embodiment . 
[Fourth Embodiment] 



10 associated with the connection state of clients and 

cache servers. As shown in Fig. 9, a public network 904 
and wireless network 905 respectively have cache servers 
909 and 913 separately connected to an Internet network 
914. This arrangement makes it possible to transfer 

15 data to clients 901 and 902 by methods suitable for the 
respective networks. 



designed to perform radio communication may have 
difficulty in accepting data depending on the state of a 

20 radio channel between the client and a disk unit 903. 
For this reason, data reduction and the like are 
executed in the cache server 913. This allows the 
client 902 to accept data in a shorter period of time 
than in accessing through the public network 904. In 

25 this manner, a data transfer method can be optimized in 
accordance with a network environment. 



Fig. 9 shows still another embodiment 



In some case, for example, the client 902 



Reference numerals 901a and 902a denote 
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browsers; 901b and 902b, caches; 906 and 910, access 
servers; 907 and 911, local networks; 908 and 912, cache 
URL address storage servers; and 915, 916, and 917, WWW 
servers . 

5 As has been described above, according to the 

present invention, since a cache server that is always 
connected to the network is made to acquire home page 
data, the home page data can be acquired in a short 
connection time. This can shorten the unnecessary wait 

10 time when a client connects to a WWW server. 

In addition, since an access list to be 
transferred to a cache URL address storage server is 
retained in a server storage cyclic folder in a client, 
the data to be transferred to the cache URL address 

15 storage server are specified. This makes it possible to 
facilitate circulation. 
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